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Abstract 

Performance of reliable communication over a coherent slow fading MIMO channel at 
high SNR is succinctly captured as a fundamental tradeoff between diversity and multiplex¬ 
ing gains. We study the problem of designing codes that optimally tradeoff the diversity 
and multiplexing gains. Our main contribution is a precise characterization of codes that 
are universally tradeoff-optimal, i.e., they optimally tradeoff the diversity and multiplexing 
gains for every statistical characterization of the fading channel. We denote this charac¬ 
terization as one of approximate universality where the approximation is in the connection 
between error probability and outage capacity with diversity and multiplexing gains, re¬ 
spectively. The characterization of approximate universality is then used to construct new 
coding schemes as well as to show optimality of several schemes proposed in the space-time 
coding literature. 


1 Introduction 

Reliable communication over slow fading point-to-point channels, where the (random) channel 
realization is fixed over the time scale of communication, is characterized by the tradeoff between 
data rate and error probability: typical fading distributions have a nonzero probability of being 
very small and thus arbitrarily reliable communication is not possible at any non-zero rate. The 
tradeoff between the data rate and the error probability is captured by the outage capacity, the 
largest rate of reliable communication for a fixed error probability. The information theoretic 
view is that of a compound channel; the slow fading channel is composed of a class of channels 
parameterized by the different channel realizations that are not in outage. The outage capacity 
is achieved by universal codes, those that work reliably over every one of the channel realizations 
not in outage. 

At high SNR, the precise (but too involved to derive code design principles) tradeoff between 
error probability and data rate is coarsely captured in terms of a tradeoff between diversity and 
multiplexing gains these are the rate of decay of error probability and the increase of data 
rate with increasing SNR. Since the tradeoff is captured at a coarser scale, we shall denote 
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codes that optimally tradeoff diversity and multiplexing gains for every slow fading channel 
as approximately universal] the approximation here refers to the coarseness in the definition of 
diversity and multiplexing gains as opposed to studying error probability and data rate directly. 
Our main result is a precise characterization of approximately universal codes. We use this 
characterization to show the approximate universality of some codes proposed in the literature 
and to also construct new space-time codes that are approximately universal. These codes 
are robust to statistical channel modeling errors, hence their engineering appeal is clear. This 
approach of using compound channel viewpoint to construct robust codes for MIMO channels 
has also been taken in a series of works in miHiii- 

We are interested in codes that achieve reliable communication over all channel realizations 
not in outage: this suggests, as done in asking for the performance of the code for the 
worst channel not in outage. This is in contrast to the traditional performance analysis where 
the error probability is averaged over the statistics of the fading channel. In particular, if the 
worst-case pairwise error probability decays exponentially with increasing SNR then such a 
code is approximately universal. For a parallel channel, the worst channel for a given pair of 
codewords is “inverse waterfilling” over the pairwise squared codeword differences. For a MIMO 
channel, the worst channel (derived in j^) aligns its singular vectors in the same directions as 
those of the pairwise codeword difference matrix and then the singular values inverse waterfill 
the singular values of the pairwise codeword difference matrix. While the exact expression of 
the worst-case pairwise error is somewhat involved, a simple worst-case code design criterion 
emerges at high SNR for both the parallel channel and the MIMO channel. 

For a parallel channel, somewhat surprisingly, the worst-case code design criterion at high 
SNR simplifies to the product distance criterion which was derived initially for the i.i.d. Ricean 
fading channel [5], though is better known for the i.i.d. Rayleigh fading channel (see Chapter 3 
of [SI)- In a compound channel setting the criterion was heuristically derived in |2], here we 
give a more precise statement for the criterion. In particular, we show that if the products of 
all normalized squared codeword differences is larger than 2“^ where R is the communication 
rate, then the code is approximately universal. This design criterion suggests a class of codes 
based on permutations of the QAM (quadrature amplitude modulation) constellation that we 
call permutation codes. Even random permutation codes are approximately universal and we 
provide examples of simple and explicit permutation codes that are approximately universal. 
We show that a code based on a rotated QAM constellation proposed in the literature |7| also 
satisfies the desired product distance property and is hence approximately universal. 

For a MIMO channel, the worst-case code design criterion is in general not simply to max¬ 
imize the determinant of the codeword difference matrix, the criterion derived for the i.i.d. 
Rayleigh fading channel |H]. This can be explicitly seen in the case of the multiple transmit 
but single receive antenna (MISO) channel: the worst channel chooses the most susceptible 
direction to confuse between a pair of codeword matrices - this is the direction of the smallest 
singular value of the codeword difference matrix. Thus the worst-case code design criterion 
for the MISO channel is to maximize the smallest singular value of the codeword difference 
matrix; different from the determinant criterion derived for the i.i.d. Rayleigh fading channel. 
More generally, the worst-case code design criterion at high SNR for a MIMO channel (with 
nt transmit and Ur receive antennas) is to maximize the product of the smallest min(nt,nr) 
singular values of the codeword difference matrix. With more receive than transmit antennas, 
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the worst-case code design criterion reduces to the determinant criterion derived for the i.i.d. 
Rayleigh fading channel. 

An important implication of our worst-case code design criterion is the following: if a code is 
approximately universal on an nt x nt MIMO channel, then it is also approximately universal for 
nt X Ur MIMO channel for every . Several space-time codes proposed in the literature satisfy 
the worst-case code design criterion and are hence approximately universal. In particular, the 
QAM rotation codes in [3 El are approximately universal for every MIMO channel with two 
transmit antennas. The recently proposed codes in uniiniini that are derived from cyclic 
division algebra are also approximately universal. 

V-BLAST |RI| and D-BLAST |14j are classical architectures for communication over a 
MIMO channel. While they are not approximately universal, we show that they are tradeoff 
optimal in some rate regime universally over a (restricted) class of channels which are rotation- 
ally invariant. In particular, this class of channels includes the i.i.d. Rayleigh fading channel: we 
show that V-BLAST with simple QAM constellations as the independent data streams achieves 
the last segment of the tradeoff curve for the n x n i.i.d. Rayleigh fading MIMO channel and 
D-BLAST achieves the first segment of every n* x 2 i.i.d. Rayleigh fading MIMO channel. These 
results are illustrated in the context of a 2 x 2 i.i.d. Rayleigh fading MIMO channel in Figure^ 



Spatial multiplexing gain: r = R/ log{SNR) 


Figure 1: Tradeoff curves: nt = rir = 2 


We have organized this paper into two distinct parts: first, we present a precise charac¬ 
terization of approximate universal codes for the general MIMO channel. In the second part, 
we discuss explicit approximately universal codes, starting with simpler channel models and 
moving on to the more involved ones. In particular, we start with the scalar channel and show 
that a simple QAM is approximately universal (this is done in Section il). Next, we study the 
parallel channel and the MISO channel in Sections [5] and El respectively. Finally we consider the 
general MIMO channel in Section [3 by demonstrating the approximately universality of some 
codes proposed in the literature, and then analyzing the approximately universal performance 
of V-BLAST and D-BLAST in Sections |H1 and El respectively. 
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2 Channel Model and The Outage Formulation 

The main focus of this paper is on the slow fading (point-to-point) MIMO channel 

y[m] = Hx[m] -|-w[m], (1) 


where m is the time index and y and x denote the output and the input vectors respectively. 
The complex Ur x nt matrix H of fading gains is randomly picked, but stays constant over 
the time-scale of communication; we suppose that the exact realization of H is known at the 
receiver. The additive noise w has i.i.d. complex Gaussian (CAA(0,1)) entries. We are interested 
in one-shot communication over this channel over a (small) length of time T. There is a transmit 
power constraint of Tn^SNR for any transmit codeword of length T. 

In this paper, we focus on the scaling at high SNR introduced in the data rate is 
measured on a scale of log SNR and the decay rate of error probability is measured on a scale 
which is a negative exponent of SNR. All logarithms in this paper are to the base 2. More 
precisely, the multiplexing and diversity gains are defined as follows. A sequence of coding 
schemes (sequence in SNR) achieves a multiplexing rate of r and diversity gain of d if 


R{sm) logPe(SNR) 

km —?rrT 7 r = r, and hm —^— = —d. 
SNR^oologSNR SNR^oo logSNR 


where i?(SNR) is the rate of the scheme and Pe(SNR) is the probability of error with maximum 
likelihood (ML) decoding for the scheme. For a given multiplexing gain r, the largest diversity 
gain supported by any coding scheme is denoted by d*{r). The goal is to find a characterization 
of this optimal diversity-multiplexing tradeoff, d*{r), for any correlated channel and then to 
find (simple) coding schemes with as small a block length (T) as possible that achieve this 
optimal tradeoff curve. 

The outage event turns out to be closely related to the problem of characterizing d*{r). It 
is defined as the set of channel realizations for which the mutual information is below the data 
rate: 

{H :/(x;Hx + w|H) < i?}, (2) 

where the input distribution is independent of the realization of H. It is shown in pQ that, in 
the scale of interest, the input distribution Px can be taken to be i.i.d. complex Gaussian for the 
Rayleigh fading channel; a similar argument for any fading distribution shows that the input 
distribution can be taken to be i.i.d. complex Gaussian. This means that the outage curve can 
be defined as: 


dout (^) 


, -logP{H : logdet(I-FSNRHH*) < rlogSNR} 

lim -; ——- 

SNR^oo log SNR 


(3) 


The outage curve dout(?’) is an upper bound to d*{r) p. On the other hand, the set 
of channel realizations that are not in outage constitute a compound channel, the capacity of 
which is r log SNR. The compound channel coding theorem guarantees the existence of universal 
codes: codes that achieve reliable communication over every MIMO channel realization that is 
not in outage. This means, that by coding over possibly long block lengths, one can actually 
achieve the outer bound of dout(^)- Therefore for the rest of this paper, we identify the outage 
curve with the optimal diversity-multiplexing tradeoff curve. Note that, we are mainly interested 
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in fading distributions such that the eigen-values are not bounded away from zero (e.g. AWGN 
channel can be considered as a fading channel). Otherwise, the outage curve will be infinite, 
and an approximately universal code will achieve it. But, the diversity-multiplexing tradeoff is 
not the right setup to study this problem. 

We are interested in universal codes that achieve the upper bound of dout(^) only to the 
extent that they are tradeoff-optimal; we call such codes approximately universal. Our main 
focus is on a characterization of approximately codes with small block-length. 


3 Main Result 


Our main result is a precise characterization of approximately universal codes. 


Theorem 3.1. A sequence of codes of rate i?(SNR) bits/symbol is approximately universal over 
the MIMO channel if and only if, for every pair of codewords, 


xlxl 


A 


> 


mm(n^,nt) — 2ii(SNR)+o(log(SNR)) ' 


(4) 


where Ai,..., are the smallest mm{nr,nt) singular values of the normalized (by 

codeword difference matrix. 


For > nt, 0 turns out to be the same as the “nonvanishing determinant” criterion 
introduced in the context of i.i.d. Rayleigh fading channels in This criterion was also 

studied in izmni, also in the context of i.i.d. Rayleigh fading channels. In [7j, it was shown that 
for two transmit antennas, if a code satisfies this nonvanishing determinant criterion, then it is 
tradeoff-optimal for the i.i.d. Rayleigh fading channel; this result has been recently generalized 
for artibtrary number of transmit antennas in m- 

Our result is much stronger: if a code satisfies the nonvanishing determinant criterion, 
then it is tradeoff-optimal for every fading distribution. Thus, our result gives the well-known 
determinant criterion a precise operational interpretation in terms of approximate universal¬ 
ity. Through this characterization, we will see that codes with small block lengths can be 
approximately universal. We start with a few implications of this criterion and then prove the 
sufficiency part of the criterion. The necessity part is proved in Appendix 0 


3.1 Approximately Universal Codes in the Downlink 

Some interesting observations follow from our characterization of approximately universal codes. 

• If a code is approximately universal over an nt x Ur MIMO channel with >nt, i.e., the 
number of receive antennas is equal to or larger than the number of transmit antennas, 
then it is also approximately universal for an n* x / MIMO channel with I > m. 

• The singular values of the normalized codeword difference matrices are upper bounded 
by a fixed number {^/nfr). Thus, a code that is approximately universal over an nt x nr 
MIMO channel is also approximately universal over an x / MIMO channel with I < nr. 

• Consider the downlink of a cellular system where the base stations are equipped with 
multiple transmit antennas. Suppose we want to broadcast common information to all 
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the users in the cell. We would like our transmission scheme to not depend on the 
number of receive antennas at the users: each user could have a different number of 
receive antennas, depending on the model, age, and type of the mobile device. Universal 
MIMO codes provide an attractive solution to this problem. Suppose we broadcast the 
common information at rate R using an approximately universal space time code over an 
rit X Tit MIMO channel. Since this code is approximately universal for every nt x Ur MIMO 
channel, the diversity seen by each user is simultaneously the best possible at rate R. To 
summarize: the diversity gain obtained by each user is the best possible with respect to 
both. 


— the number of receive antennas the user has, and 

— the statistics of the fading channel the user is currently experiencing. 

3.2 Characterization of approximately universal codes 

Towards our goal of characterizing approximately universal codes, we first calculate the pairwise 
error probability for a pair of codewords based on the worst channel realization not in outage, 
i.e., we consider the realization (not in outage) as a function of the specific pair of codewords 
so as to yield the worst pairwise error probability. If this worst-case pairwise error probability 
decays exponentially with SNR for every pair of codewords (we allow the worst channel to change 
as a function of the pair of codewords), then a simple union bound argument shows that the 
error probability conditioned on the channel realization not in outage decays exponentially with 
SNR: the total number of codewords is only polynomial in SNR; for example if the multiplexing 
rate is r, the rate is i? = r log SNR and the total number of codewords is SNR^. Since the error 
probability is lower bounded by the outage probability, we arrive at a sufficient condition for 
approximate universality of a code: 

the worst-case (over channels not in outage) pairwise error probability for every pair 
of codewords should decay exponentially with SNR. 

It turns out that this condition is necessary as well; thus we have an exact characterization of 
approximately universal codes. 

In Section Id. 2. II we derive an expression for the worst-case pairwise error for a pair of 
codewords. This derivation allows us to explicitly characterize approximate universality of a 
code in terms of a condition on its pairwise difference codewords. It is fruitful to contrast 
our approach with the traditional “code design criterion” for space-time codes in the literature 
where the pairwise error probability is averaged over the channel statistics. This criterion 
indeed depends on the specific channel statistics being considered. This is in stark contrast to 
the worst-case analysis we have proposed; the corresponding “universal code design criterion” 
does not depend on the channel statistics and characterizes properties of a universal code: the 
engineering appeal of the universal code design criterion is natural; modeling channel statistics 
is a bit of an “art” in practice and it is useful to have a code that is robust to a variety of 
channel statistics. 

The classical code design criterion for the i.i.d. Rayleigh fading channel is the determinant 
criterion] as we will see in Section [11.2.II the universal code design criterion at any specific SNR 
is quite different from the determinant criterion. However, it is also somewhat involved and 
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is not directly suited to verify or to design approximately universal codes. In Section ld.2.dl 
we derive a simplified condition for approximate universality taking the high SNR scaling into 
consideration and this high SNR criterion is indeed very closely related to the determinant 
criterion. 


3.2.1 Worst-case Pairwise Error Probability 

Our approach is to study the worst-case pairwise error probability of the code over MIMO chan¬ 
nel realizations not in outage. The pairwise error probability between two codeword matrices 
and Xb (of length T > nt), conditioned on a specific realization of the MIMO channel H, 
is 

g , (5) 

where D is the normalized codeword difference matrix 


D = 




(X. 


Xs). 


Expanding the channel and codeword difference matrices using the singular value decomposition 
(SVD), 

H := and D := UaAV^, (6) 

the pairwise error probability in can be rewritten as 


Q 


'SNR, 


'EVJU2AI 


(7) 


Suppose the singular values are increasingly ordered in A and decreasingly ordered in 


Tf := diagjV'i,... ,V’minK,nt),0,... ,0} , and A := diag {Ai,..., A„J . 

Then the worst-case rotation can be determined and it turns out to be the one that aligns the 
weaker singular values of the channel matrix with the stronger singular values of the codeword 
difference matrix j2j . More precisely, the channel eigen-directions Vi that maximize the pairwise 
error probability in (TTl) is IHl 

Vi = U2. (8) 

Now, the no-outage condition is only a condition on the non-zero min(n,.,nt) singular values 
of the fading matrix and is given by: 


min(nr,nt) 

log(l + SNR|V’,|2) >R. (9) 

£=l 


Hence the worst-case pairwise error probability for the MIMO channel reduces to the optimiza¬ 
tion problem 


min 

Ipl ;...,hmin(n7. ,71^) 


SNR 

2 


mm{nr,nt} 

E 


e=i 




( 10 ) 
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subject to the constraint in ©• 

If we define := SNR • then the optimization problem can be rewritten as 


1 


mm — 

Ql>0,---,Qiaiii(nr,nt)>0 2 


min(nr,nt) 


1=1 


subject to the constraint 


min(nr ,nt) 


£=1 



> R. 


This is the dual of the problem of minimizing the total power required to support a target rate 
R bits/symbol per sub-channel over a parallel Gaussian channel; the solution is just standard 
waterhlling, and is given by 


min 

Here A is the Lagrange multiplier chosen such that the channel in m satisfies with equality. 
The worst-case pairwise error probability is 


{nr,nt). (11) 


Qi := SNR 




where A satishes: 


Q 


\ 


rain{nr,nt) , 

1 E ()-iv 


e=i 


A 


mm(nr,nt) 

E 


e=i 



+ 

= R. 


( 12 ) 


(13) 


For convenience, we denote the argument of the Q function at the worst-case channel 

realization as the universal code construction criterion for the given difference codeword pair. 
In general, the goal is to maximize this universal code construction criterion: 


min(nr,nt) 


E 

£=1 




(14) 


3.2.2 A Closer Look at the Universal Criterion 


To get a feel for the universal criterion in d, consider the simple case when codeword differ¬ 
ence eigenvalues have the same magnitude, i.e., |Ai| = ••• = |A,iJ. Then A can be explicitly 
calculated: 


1 

A 


2^/ min(nr,nt 



Thus the universal criterion is given by 


mm{nr,nt) _ l) |Ai|2, 
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a simple function of the magnitude of the normalized codeword difference. To understand the 
situation in general, let us suppose without any loss of generality that |Ai| < • • • < 

Now consider the largest k such that 

|Afc|2<2^/'^|Ai---Afc|2/^<|Afc+i|2, (15) 

with defined as +oo. Then A can be calculated explicitly: 

i = 2«/^|Al•••Afc|2/^ (16) 

satisfies jinj. Thus the universal code design criterion turns out to be 

^A:(2^|AiA2---Afc|2)'/"-^|A,|2^ , (17) 

a combination of the geometric and arithmetic means of the magnitudes of the k smallest 
singular values of normalized codeword differences. While this calculation sheds some insight 
into the nature of the universal code design criterion, it still does not lend itself to designing 
or verifying approximately universal codes. Towards making this expression more amenable to 
code design, we would like to develop a high SNR approximation; this is done next. 

3.2.3 Proof of Theorem ism 

Our goal here is to show that for a sequence of codes satisfying Q, the probability of error 
has the same decay rate as that of the outage probability for all fading distributions. The 
probability of error can be upper bounded using a smart union bound (as in ^): 

IPe < P{0} + P(error, O'^). (18) 

Here we have denoted the outage event by O. Similar to the union bound, the second term 
can be upper bounded by a sum of pairwise errors averaged over all channel realizations not 
in O. This sum can be further upper bounded by the sum of the worst-case (over all channel 
realizations not in O) pairwise error probabilities. For the probability of error to behave like 
the probability of outage for every fading distribution, we require the second term in (ITHl) to 
decay exponentially in SNR (= for some 5 > 0). One way to do this is to make every 

worst-case pairwise error decay exponentially in SNR. 

Instead of considering a single outage event, we consider a sequence of outage events Og, 
parameterized by e > 0: the channel realizations not in are those that are strictly inside the 
no-outage region: 

min(nr,nt) 

log(l + |V’£pSNR) >R(l + e). 

i=i 

For a pair of codewords, the worst-case pairwise error probability is (Q 
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where A satisfies (see (ESI)) 


min(n^,nt) 

E 


i=\ 



+ 


i2(l + e). 


(19) 


Since the codeword differences satisfy the condition in A can be explicitly calculated (see 
m and m) 

J = (|Al| ••• ■ (20) 

Thus the worst-case pairwise error probability can be upper bounded by (see (El)): 


Q 


min 


inr,nt) 2«(i+^) (|Ai| • • • |A^i,(„^,„,)|) - ZT=i 


mm{nr,nt) 


lA.p 


^/2 


( 21 ) 


Again using the supposition in (^, the first term in (EU) is growing unbounded with increasing 
SNR, while the second term in (EU is bounded above by 2ntT (a constant) because of the 
power constraint. Thus, the second term can be ignored for increasing SNR and we can write 
the following upper bound to the worst-case pairwise probability of error (using ©) 


(S) ^ (^) ■ 

With R = r log SNR, we conclude that the pairwise error probability conditioned on the channel 
realization not in decays exponentially with SNR. Since the number of codewords is poly¬ 
nomial in SNR, the overall error probability conditioned on the channel realization not in Oe 
decays exponentially with SNR. Thus the error probability decays at the same rate as PjOc}. 
Letting e become arbitrarily close to zero, this decay rate can be made arbitrarily close to 
that of the outage probability. Thus the sequence of codes achieves the optimal tradeoff curve, 
and further for every fading distribution, we conclude that the sequence of codes satisfying 
is approximately universal. This completes the sufficiency part of Theorem KLII necessity is 
proved in Appendix ^ Next, we discuss some explicit schemes that are approximately uni¬ 
versal, starting with the simple scalar channel and then moving onto more complex channel 
models. 


4 QAM is Approximately Universal for the Scalar Channel 

The single antenna (transmit and receive) channel model can be written as (dropping the time 
index): 


y = hx + w. 


The criterion for approximate universality (cf. Theorem KLlI) simply translates into a minimum 
distance one for the code: 


d ■ > - 

“mm ^ 2^(SNR)-l-o(logSNR) ’ 


( 22 ) 
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where dmm is the normalized minimum distance over all the codeword pairs for the coding 
scheme. Now, consider a simple coding scheme with unit block length: QAM of size 2^. The 
normalized minimum distance of this QAM has the property 


^min ’ 

and is therefore approximately universal for the scalar fading channel. 


5 The Parallel Channel 

The parallel fading channel with L diversity branches at time m is 

yi[m] = hiXi[rn] + we[m], (23) 

Here wi[m], ..., WL[m] are i.i.d. CM (0,1). The approximate universality criterion for the par¬ 
allel channel is stated in the following theorem. The proof is very much similar to the general 
approximate universality proof in 13.2.31 hence we omit it here. 

Theorem 5.1. A sequence of codes with rate i2(SNR) bits/symbol is approximately universal 
if and only if, for every pair of codewords, the normalized codeword differences di,..., (the 
rows of the difference codeword matrix) satisfy 

■ ll^2|| •••||dL|| > 2-R(SNR)+o(logSNR) ■ 

In the rest of the section, we study a simple class of codes that are approximately universal. 
Our main focus is on unit block length codes based on permutations of a QAM constellation 
that we call permutation codes.^ We show in Section f5 . 21 that even a random permutation code 
is approximately universal; thus space-only approximately universal codes exist. Finally, we 
demonstrate simple examples of approximately universal permutation codes: these codes are 
easy to represent (so the storage complexity is low) and very easy to encode and decode (so the 
run time complexity is small as well). The parallel channel with two sub-channels is studied 
in Section 15.31 where a bit-reversal permutation is shown to be approximately universal; this 
scheme also provides an operational interpretation to the outage condition (defined based on 
an information theoretic underpinning) of the parallel channel. Simple permutation codes for 
the parallel channel with more than two sub-channels are the topic of Section 15.41 

5.1 Approximate Universality of Codes Based on Rotation of PAM 

The criterion of maximizing the product-distance has been known in the context of the i.i.d. 
Rayleigh fading channel. A code construction based on rotations of PAM constellations is 
discussed in m-- the transmit codeword vector x := [xi, • • • ,xl] is defined as 

X = uM, (25) 

where ui,... ,ul are independent PAM constellations and M is an orthonormal matrix. m 
shows existence of M such that the code has the maximum diversity possible, i.e., a non-zero 
^These codes are intimately related to interleaver designs in turbo codes. 
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product distance. The problem of explicitly maximizing the minimum product distance was 
later considered in HH: it was treated as an optimization problem over M for fixed input 
constellations. For L = 2, the M that maximizes the product distance was explicitly found 
using computer simulations. Later, a similar idea of rotating QAM constellations was proposed 
in [7] as a part of the 2x2 code construction. It follows from Theorem 2 in [7] that these codes 
are also in fact approximately universal for the parallel channel. 

Unfortunately, no generalizations of the rotation based codes exist when there are more 
than two sub-channels. Further, these codes are hard to decode for large constellation sizes. 
Therefore, we propose another approach: QAM constellations are the basis of the code design 
but we consider mappings that utilize the algebraic structure of the constellation; these map¬ 
pings are nonlinear with respect to the Euclidean vector space in which the QAM constellations 
are embedded - this is in contrast to the rotation operation which is a linear mapping. 

5.2 Permutation Codes 

We would like to construct simple space-only (i.e., unit block length) approximately universal 
codes. As a step towards simple encoding and decoding, suppose the QAM constellation to be 
the alphabet for each sub-channel. We need to protect every codeword by coding it across every 
sub-channel: for the code to have any chance of being approximately universal, it should allow 
reliable communication for every channel realization not in outage and, in particular, over the 
parallel channel where all but one sub-channel is zero. Two design implications are suggested: 

1. With a rate of R bits/symbol, each of the QAM constellations on the sub-channels has 
2^ points. 

2. With 2'^-point QAM as the alphabet for each sub-channel, the points in the constellation 
over each sub-channel can be identified one-one with points in the constellation of the 
other sub-channels. In other words, the QAM constellation over one sub-channel is a 
permutation of the points in the QAM constellation over any other sub-channel. 

Mathematically, the permutation code can be represented as 

C = {qj2{q),-jL{q)) \q G Qz 

where 

Qz = |(a + ib) 
is the integer-QAM with 2^ points, and / 2 ,. 

5.2.1 Examples 

Repetition coding is a simple example of a permutation code: the permutations are just the 
identity. Figure |21 illustrates the permutation code with identity permutation for L = 2. Here 
Qz is the QAM with 16 points. 

For L = 2, Figure 01 shows a permutation code with 16 codewords that is designed to 
maximize the minimum product distance. Product distance of this code is an improvement over 


H R 

2 2 2 2 

- < a, b < — 

2 “ 2 


(26) 


., /l are permutations of Qz- 
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Figure 2: Repetition coding: L = 2, R 

= 4. 
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Figure 3: Permutation code: L = 2, R = 4 


the repetition code in Figure EJ by a factor of 4. The code in Figure El and its generalization 
to larger L is discussed in m using the theory of spreading transforms. The focus in m is 
on finding codes that have a non-zero product distance and can be efficiently constructed from 
smaller constellations (QPSK) using spreading transforms. 

5.2.2 A Random Permutation Code Ensemble 

Our search for permutation codes that are approximately universal leads us to study permuta¬ 
tions with large QAM alphabet sizes. To get a feel for whether there indeed exist permutation 
codes with large enough product distance, we can look at an appropriate random permutation 
ensemble and see if the product distance averaged over this ensemble of permutation codes has 
the desired property. If this is the case, then there must have been at least one permutation 
code in the ensemble that is approximately universal. Averaging the product-distance itself is 
not good enough; we look at the inverse of the product distance and average it over all possible 
permutation codes with the uniform measure. Our main result is the demonstration of existence 
of permutation codes that are approximately universal: 

Theorem 5.2. There exists a sequence of permutation codes that is approximately universal 
over the parallel channel. 

The details of the proof are relegated to Appendix I bI 
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5.3 Two Sub-channels: Bit-Reversal Permutation Code 

While it is encouraging to know the existence of permutation codes that are approximately 
universal, it is of engineering interest to actually construct simple approximately universal 
codes from this ensemble. It turns out that an operational interpretation of the outage condition 
(which was dehned based on an information theoretic understanding of the compound channel) 
suggests natural permutation codes that are approximately universal. In this section, we focus 
on the special case when the parallel channel has just two sub-channels, i.e., L = 2. 

5.3.1 Operational Interpretation to the Outage Condition 

If we communicate at a rate of R bits/symbol over the parallel channel, the no-outage condition 
is 

log(l + |/ii|2SNR)+log(l + |/i2|2SNR) >R (27) 

One way of interpreting this condition is as though the first sub-channel provides log(l -|- 
I/lips NR) bits of information and the second sub-channel provides log(l -|- |/i 2 pSNR) bits of 
information, and as long as the total number of bits provided exceed the target rate, then 
reliable communication is possible. In the high SNR regime, we exhibit below a permutation 
code that makes the outage condition concrete. 

Suppose we independently code over the I and Q channels of the two sub-channels. So we 
can focus on only one of them, say, the I channel. We wish to communicate R/2 bits over two 
uses of the I-channel. Analogous to the typical event analysis for the scalar channel, we can 
exactly recover all the R/2 information bits from the first I sub-channel alone if: 

ilog(l + |hi|2SNR)>|. 

However, we do not need to use just the first I sub-channel to recover all the information bits; 
the second I sub-channel also contains the same information and can be used in the recovery 
process. Indeed, if we create x\ by treating the ordered R/2 bits as the binary representation 
of the points x\, then one would intuitively expect that if 

ilog(l + |hi|2SNR) > fei, (28) 

then one should be able to recover at least ki of the most significant bits of information. Now, 
if we create x\ by treating the reversal of the R/2 bits as its binary representation, then one 
should be able to recover at least k 2 of the most significant bits, if 

ilog(l + |/i2pSNR) > A:2. (29) 

But due to the reversal, the most significant bits in the representation in the second I sub¬ 
channel are the least significant bits in the representation in the first I sub-channel. Hence, as 
long as ki + k 2 > R/‘2^ then we can recover all R/2 bits. This translates to the condition 

log(l + |hi|2SNR)+log(l + |/i2|2SNR) >R, (30) 

which is precisely the no-outage condition (|77|>- Thus, the bit-reversal scheme gives an opera¬ 
tional meaning to the outage condition. 
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5.3.2 Bit-Reversal Permutation Code 


To make this idea concrete, first we need to define bit reversal. A QAM can be thought of 
as two independent PAMs, and using I and Q channels separately is equivalent to taking the 
QAM permutation as two independent PAM permutations. Therefore we concentrate on one 
of the PAMs and define the bit-reversal permutation for it. For a PAM with 2^/^ points, we 
number the points from left to right by 0 to 2^/^ — 1. Based on this numbering, a canonical bit 
sequence of length R represents each point in the PAM constellation. Bit reversals are defined 
based on this representation. The bit-reversal map for the 4-PAM is illustrated in Figure [I] 



Figure 4: The bit-reversal map for a 4-PAM. 


5.3.3 Product Distance and Bit Reversals 

To show that the bit reversal scheme is approximately universal, we have to show that it 
satisfies the criterion in (EH). However, the plain bit-reversal is not approximately universal. 
The problem is the inherent assumption in the operational interpretation that if two points 
have different MSB, then they are far apart geometrically and hence cannot be confused with 
each other. This, however, is not true. Consider the points with the binary representations: 

011---10 and 100-••01. 

Even though their MSB is different, they are separated by a fixed distance of 3 independent of 
the length R/2 of the binary representation. The same is true for their bit-reversals. Thus, the 
product distance between this codeword pair is and it does not satisfy (I^H) for large R. 

Even though the simple bit-reversal is not optimal, it can be modified so that it essentially 
retains the operational interpretation (so it is still easy to decode) and is approximately univer¬ 
sal. We discuss two such modifications here: irregularly spaced PAM and alternate-bit-flipping. 

5.3.4 Irregularly Spaced PAM Permutation Code 

We have seen that the problem with the bit-reversal scheme is the inherent assumption that 
the two points having different MSB are geometrically far apart. A simple way to get around 
this problem is to put gaps in the PAM constellation. That is, we introduce a gap of 
between Oil • • • 1 and 100 • • • 0 so that any two points with different MSB are indeed far apart. 


15 




More precisely, to retain the operational interpretation, one has to put a gap of g'2™' for every 
bit-change to ensure that the product distance condition is met. The PAM constellation 
is now irregularly spaced.'^ 

Consider any two points in the irregularly spaced PAM constellation. Suppose the first 
MSB they differ in their bit representation is the one: then by construction the normalized 
distance between the two points is lower bounded by 

The bit-reversals of these two points must have the same m — 1 LSBs but a different LSB; 
so the normalized distance between the bit-reversals of these two points is lower bounded by 

2'm—Rf2 

Putting these two together, we conclude that the normalized product distance between a pair 
of codewords in the bit-reversed irregularly spaced permutation code is lower bounded as 

\did2\ > 

^ 9 

2RI2- 

Comparing this with (EH), we conclude that the code is approximately universal. 

A potential drawback of this approach is that the extra gaps translate into an increase in 
the amount of power used for the same rate. Thus, for a PAM of size 2^/^, the normalized 
increase in size is given by 

R/2 R/2 

(number of bit-changes) = Y {2^/^-^), 

m=l m=l 

= 9R/‘2. 

With R = r log SNR, the SNR of this scheme is increased by a factor of (1 -|- log SNR/2). In 
the diversity-multiplexing scaling of our interest, this is an insignificant increase and thus the 
code is still approximately universal. 

5.3.5 Alternate-Bit-Flipping Permutation Code 

Another modification of the plain bit-reversal scheme is to flip every alternate bit after reversing. 
For example, the point in the PAM constellation with bit representation 111111 is mapped to 
the point in the PAM constellation with bit representation 010101. The scheme is illustrated 
for the 4-PAM constellation in Figure [5J 

In general, consider the i?/2-bit representation of integers ai and 02 between 0 and 2^/^ — 1: 

~ ^R/2 ■ ■ ■ ^1) 

^The same idea of introducing gaps is also present in the Cantor set based representation in m 
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Figure 5: Bit Reversals with alternate bits flipped. 

The the alternate-flip bit-reversal map B is defined as (assuming R is even): 

B{ai) = ^ 1^2 ■ ■ ■ 

B{a2) = b1^2 " ' ^%/2 -i^‘r/2- 

An easy observation is that this scheme maintains the integrity of the operational interpretation 
since the decoder can always flip the bits back after estimating the flipped bits. Further, this 
scheme turns out to be approximately universal: 

Theorem 5.3. For every oi and 02 between 0 and 2^/^ — 1; 

|ai- 02 ! |B(ai)- 5 ( 02)1 ^ 1 

2R/2 2^/2 - 8 • 2^/2 ■ ^ ^ 

The details of the proof are somewhat involved and are relegated to Appendix lO 

5.4 Explicit Permutation Codes for General Parallel Channel 

In an effort to generalize the bit-reversal scheme consider the following alternative, but equiv¬ 
alent, view of the same scheme (for L = 2). 

5.4.1 Bit-Reversal as a Linear Operation 

Each codeword in the bit-reversal permutation code is represented by a sequence of, say 2n 
bits. The first n bits correspond to a point in a 2"'-PAM constellation. The corresponding PAM 
constellation point is then transmitted over the I channel of the first sub-channel. The last n 
bits similarly correspond to a point in another 2”-PAM constellation which is then transmitted 
over the Q channel of the first sub-channel. The transmissions over the I and Q channels of the 
second sub-channel are the points in the PAM constellation that correspond to bit-reversals of 
the first and last n bits, respectively, of the total 2n bits that define the codeword. 

If we hx the mapping between the sequence of bits and points in a PAM constellation, the 
bit-reversal scheme can be viewed entirely as an operation on the 2n bits that represent the 
codeword. Further more, if we decide to do the same operation over both the I and Q channels 
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(as in the bit-reversal scheme), then we only need to consider operations over the first n bits 
that represent the codeword. In the rest of this discussion, we consider only the operation on 
the first n bits representing the codeword. The operation involved in bit-reversal is particularly 
simple: it is a linear operation on the vector of bits (over the field F 2 ). Linear operations can be 
represented by matrices and the bit reversal scheme corresponds to two matrices: the identity 
matrix (Ijj) for the first sub-channel and the cross-diagonal matrix with unit entries on the 
cross diagonal (D„) for the second sub-channel. 

The outage interpretation implies that the decoder can deduce ki most signihcant bits 
from the first sub-channel (see (EHl)) and k 2 most significant bits from the second sub-channel 
(see (HH)). Because of the simple mappings in this case, the ki bits from the first sub-channel 
correspond to the first ki bits of the vector of n bits representing the codeword and k 2 bits from 
the second sub-channel that correspond to the last ^2 bits of the vector of n bits representing 
the codeword. As long as ki -\- k 2 > n, the decoder can determine the codeword correctly. 

5.4.2 Universally Decodable Matrices 

This view of the bit-reversal scheme suggests a natural generalization to more than two sub¬ 
channels. We first generalize the bit representation of the integers points of the PAM constel¬ 
lation: we allow g-digit representation over a finite field ¥q. Next we consider a (sequence of) 
collection of L matrices ..., of size nxn with entries selected from the finite field 

¥q. These matrices naturally generate a sequence of permutation codes: for a permutation code 
conveying 2n g-digits of information, we transmit over the I channel of the sub-channel the 
point in the 2”-PAM constellation that corresponds to the (/-digit sequence that results from 
the linear operation of A^ ' over the first n (/-digits of the 2n information q-digits. This is done 
for each of the £ = 1,..., L sub-channels. Further, the same linear operations are used on the 
last n information g-digits to transmit points from the PAM constellation on the Q channels of 
the L sub-channels. 

We say that this collection of matrices is universally decodable if for any fci,..., such that 

L 

> 0, ^ = 1,..., L and ^^k£>n, (32) 

e=i 

the collection of the first ki,... ,kL rows of the matrices A^”'\ ..., A^^ respectively is full rank, 
i.e., spans the vector space F”. 

Universally decodable matrices (UDMs) provide an operational interpretation to the in¬ 
formation theoretically defined outage condition. The number ki can be interpreted as the 
amount of (/-digits provided by the sub-channel; this depends on the corresponding channel 
amplitude \hi\. If the channel is not in outage, then (El holds. The full rank condition implies 
that a unique codeword can be decoded whenever the channel is not in outage. We formally 
state the implication of this operational interpretation to outage below; the proof is relegated 
to Appendix IdI 

Theorem 5.4. A sequence of UDMs leads to an approximately universal permutation code 
sequence. 
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Observe that the encoding and decoding complexity of the code based on UDMs is simply 
linear in the number of bits n and the number of sub-channels L. The representation of the 
code involves storing the L matrices with a total of Ln^ entries, again a very small number. 

In the rest of this subsection, we focus on explicit construction of UDMs. First, we show 
how UDMs can be easily constructed from maximum-distance separable codes (MDS) (though 
these constructions require a field size that grows with n). In section I5.4.4L we present fixed 
field size constructions for L = 3 and then discuss a recent construction HOI, for arbitrary L. 


5.4.3 Reed-Solomon Codes are Approximately Universal 

In general, some progress on the search for universally decodable matrices can be made by 
strengthening the requirement on the collection of matrices by requiring the collection of any 
n rows from the matrix 




{nf 


(n)‘ 


(n)‘ 


to be full rank. Note that such a collection of matrices is still universally decodable. This 
problem is same as designing a maximum distance separable (MDS) codes with as its 
parity check matrix. The condition universal decodability condition is the same as requiring 
that the minimum distance of the code to be at least n -|- 1. Since A^”) is an n x Ln matrix, 
such a code has length Ln and rate Ln — n. A simple singleton bound shows that then the code 
must be [Ln, Ln — n,n+ 1]^ 

Simple examples of such a code exist and this allows us to explicitly construct the parity 
check matrix A^"'^ For a finite field Fg, a [q + l,k,q — k + 2] extended Reed-Solomon code can 
be explicitly constructed for every k < q + 1 (see Chapter 6.8 of HU for the exact parity check 
matrix). For the extended Reed-Solomon codes, the field size grows with the block-length. In 
fact, the field size is at least Ln — 1. In our setting, n grows as log SNR, thus the field size grows 
like log SNR. As noted in the proof of Theorem 15.4L this still gives an approximately universal 
code. 

Next, we focus on the situation of practical and theoretical interest: constructing UDMs 
with a field size not growing with n. With L = 2, we have already seen an example: 
where In is the n x n identity matrix and D„ is the n x n cross-diagonal matrix with all unit 
entries on the cross diagonal; here the field size q = 2. 


5.4.4 L = 3: Universally Decodable Matrices 

Consider the following collection of binary matrices (i.e., the field size q = 2): {I„,D„,T„}, 
where and are, as before, the n x n identity and cross-diagonal matrix with unit cross 
diagonal entries, respectively. T„ is defined using the recursive definition: 


T2n 


T„ Tn 
0 Tn 


(33) 


® An [n, k, d\ code over F, is a linear, length-n code with codewords and a minimum Hamming distance of 
d. Its parity check matrix is a n — fc x n matrix over F^. Codes for which d = n — k -\-l meet the singleton bound 
(see Chapter 3.2 in ED) and are called MDS codes. These codes are well-studied in coding theory and explicit 
codes like the Reed-Solomon codes are MDS codes. 
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with Ti = [1]. Equivalently, T 2 n = T 2 < 8 )T„, where ( 8 ) denotes the tensor ii'ronec/cer product 
operation between two matrices (cf. Chapter 4.2 in [Hj). For 2™“^ < n < 2"^, we define to 
be the principal sub-matrix of T 2 ’t». We omit our original proof of this result (it is still available 
in an earlier version of this paper ESI), in light of a crisper proof that follows from a more 
general result in ESI; this generalization was motivated by the present construction for L = 3. 

For L = 4, g = 3 computer simulations are used in EH to justify the conjecture that the 
following collection of matrices is universally decodable: where the first two 

matrices are, as before, the n x n identity and cross-diagonal matrix with unit cross diagonal 
entries, respectively. With n = 3, define 


(34) 


For n a power of 3, we define, recursively, = T 3 (g) T„ and Rs^ = R 3 (g) R^, with the 
multiplication operations in the context of the field F 3 . For 3™“^ < re < 3”^, we define T„ and 
R„ to be the principal sub-matrices of Tsm and Rs^, respectively. This conjecture has now 
been verified as a special case of the general result in m- 
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5 . 4.5 A Complete Characterization of UDMs 

Motivated by the results in the previous two subsections, the authors in EHI, have recently 
completely solved the problem of constructing UDMs. They show for any re the condition 
L < q + 1 is both necessary and sufficient. They construct UDMs based on Pascal’s triangle. 
We state their construction (see Proposition 9, EO]), for completeness: 

Theorem 5.5. Let q be a prime power and let L < q + 1. Suppose a is a primitive element 
over¥q. Then the following matrices are UDMs: 

At — Ir 


[Ad 


■1 - -*-725 

A2 = D„, 

UL) ~ (i) for 1 < j,k < n and 3 < i < L, 


where (j) is defined as the natural mapping to prime sub field of¥q of the natural number 

k{k — 1) ■ ■ ■ {k — j + 1) 
j(j-l)---l 


6 The MISO Channel 


The parallel channel allowed us to study approximately universal codes on channels with solely 
multiplexing gain. We now turn to study channels that offer solely diversity gain: the MISO 
and SIMO channels, with multiple transmit (receive) and single receive (transmit) antennas, 
respectively. The SIMO channel can be reduced to a scalar channel by considering a scalar 
sufficient statistic: receive beamformed vector. Therefore, any approximately universal scheme 
for the scalar channel, such as the QAM scheme (see Section IH), will also be approximately 
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universal for the SIMO channel. In this section, we focus on the MISO channel and understand 
properties of approximately universal codes over this channel. 

The scalar output of a MISO channel with nt transmit antennas at time m can be written 
as 

y[m] = h*x[m] + w[m], 

where x[m] is an n* dimensional vector input and h is the n^-dimensional vector of fading gains 
hiS. 

6.1 Characterization of Approximately Universal Codes 

The approximate universality criterion for the MISO channel can be stated as (see Theorem 
EH), for every codeword difference matrix: 

^ 2ii(SNR)+o(logSNR) ’ 

where Ai is the minimum singular value of the codeword difference matrix. 

There is an intuitive explanation for this result: a universal code has to protect itself against 
the worst channel that is not in outage. The condition of no-outage only puts a constraint on 
the norm of the channel vector h but not on its direction. So, the worst channel aligns itself 
to the “weakest direction” of the codeword difference matrix. The corresponding worst-case 
pairwise error probability is governed by the smallest singular value of the codeword difference 
matrix. 

On the other hand, the i.i.d. Rayleigh channel does not prefer any specific direction: thus 
the design criterion tailored to its statistics requires that the average direction be well protected 
and this translates to the determinant criterion. While the two criteria are different, codes with 
large determinant tend to also have a large value for the smallest singular value; the two criteria 
(based on worst-case and average-case) are related in this aspect. 

For the case when nt = 2, the Alamouti scheme EH] converts the MISO channel to a scalar 
channel with gain ||h|| and the total SNR reduced by a factor of 2. Hence, the outage behavior is 
exactly the same as in the original MISO channel, and the Alamouti scheme provides a universal 
conversion of the 2x1 MISO channel to a scalar channel. Any approximately universal scheme 
for the scalar channel, such as a QAM, when used in conjunction with the Alamouti scheme 
will be approximately universal for the MISO channel. 

In the general case when the number of transmit antennas is greater than 2, there is no 
equivalent to the Alamouti scheme. Here we explore one approach to construct approximately 
universal schemes for the general MISO channel: we consider a simple scheme that converts 
the MISO channel into a parallel channel and show that the scheme is approximately universal 
over a restricted class of MISO channel statistics. 

6.2 MISO channel viewed as a Parallel Channel 

Consider the simple scheme of using one antenna at a time to communicate at a rate of R 
bits/symbol on the MISO channel. By using one transmit antenna at a time, we arrive at a 
parallel channel with nt sub-channels and the data rate of communication is R bits/symbol 
per sub-channel. We code over the antennas using a parallel channel code, e.g. a permutation 
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code. Our first result is that this simple scheme is tradeoff optimal for the i.i.d. Rayleigh fading 
MISO channel. 

Can this conversion be approximately universal? To see that this could not be the case, 
consider the following (worst-case) MISO channel model: the channels from all but the first 
transmit antenna are very poor. To make this example concrete, set /i£ = 0, i = 2,... ,nt. The 
tradeoff curve depends on the outage probability (which depends only on the statistics of the 
first channel). Using one transmit antenna at a time is a waste of degrees of freedom: since 
the channels from the all but the first antenna are zero, there is no point in transmitting any 
signal on them. Thus the scheme could not have been tradeoff optimal over a MISO channel 
with such statistics. 

Essentially, using one antenna at a time equates temporal degrees of freedom with spatial 
ones. All temporal degrees of freedom are the same, but the spatial ones need not be the same: 
in the extreme example above, the spatial channels from all but the first transmit antenna 
are zero. Thus, it seems reasonable that when all the spatial channels are symmetric then the 
parallel channel conversion of the MISO channel is tradeoff-optimal. This intuitive argument 
is formalized in the proposition below; the proof is provided in Appendix lEl 


Proposition 6.1. An approximately universal parallel channel code sequence used over the 
antennas of a MISO channel, one antenna at a time, is tradeoff-optimal for the class of MISO 
channels with i.i.d. fading coefficients. Further, the optimal tradeoff curve of the MISO channel 
is given by 

d*(r) = ant{l — r), 0 < r < 1, (36) 


where 


logP (|/i£p < x) 

:= lim-)- 

log X 


= 1,... ,nt. 


(37) 


We have seen that the conversion of the MISO channel into a parallel channel is tradeoff- 
optimal for the i.i.d. Rayleigh fading channel. To get a practical feel for how much loss the 
conversion of the MISO channel into a parallel channel entails with respect to the optimal 
outage performance, we plot the error probabilities of two schemes with the same rate {R = 2 
bits/symbol): uncoded QAMs over the Alamouti scheme and the permutation code in FigureOl 
This performance is plotted in Figure IHl where we see that the conversion of the MISO channel 
into a parallel channel entails a loss of about 1.5 dB in SNR for the same error probability 
performance. This is a fairly small loss and suggests the practical utility of the conversion of 
the MISO channel with larger number of receive antennas to a parallel channel. 


7 The MIMO Channel 

Having studied the construction of approximately universal codes over the parallel and the 
MISO channel, we are now ready to move over the general MIMO channel: we first conclude the 
approximate universality of some recently proposed codes and then explore the approximate 
universality properties of two classical space time coding architectures: D-BLAST and V- 
BLAST. 
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Figure 6 : The error probability of uncoded QAM with the Alamouti scheme and that of a 
permutation code over one antenna at a time for the Rayleigh fading MISO channel with two 
transmit antennas: the permutation code is only about 1.5 dB worse than the Alamouti scheme 
over the plotted error probability range. 


7.1 Approximate Universality of Number-Theoretic Codes 

Some of the recent space time code constructions in the literature have a number-theoretic 
flavor. In particular, a rotated QAM constellation was used to construct a two transmit antenna 
space time code in iziiiiini- For arbitrary nt, cm proposes codes derived from cyclic division 
algebras. Some constructions based on cyclic division algebras are also presented in mnn]. 
All these two codes satisfy the non-vanishing determinant criterion. The authors in mini used 
this property to conclude the tradeoff optimality over the i.i.d. Rayleigh fading channel. In the 
light of our characterization of approximate universality (cf. Theorem KI.ll) , we can conclude 
that all these codes are approximately universal; further more, in the light of the discussion in 
Section EH we can conclude that these codes are approximately universal simultaneously for 
every MIMO channel with nt transmit antennas (nt = 2 for the code in [Tj E] ) and arbitrary 
Ur- To see this formally, we discuss the two transmit antenna code in [7] in some detail. 

The rotated code QAM code in Q spans two symbols and is designed to work over the two 
transmit MIMO channel. The entries of the 2x2 transmit codeword matrix X := [xij] are 


Xu 

■■= QiOl) 

Ul 

, and 

X 21 

:= Q( 02 ) 

U 3 

X 22 _ 


. '“2 . 


a;i 2 


Ui 


Here ui, U 2 , U 3 , U 4 are independent QAMs of size 2^2 each (so the data rate of this scheme is 
R bits/symbol). The rotation matrix Q(0) is 


Q{0) := 


cos 9 — sin 9 
sin 9 cos 9 


With the choice of the angles 9i, 02 equal to 1/2 tan ^ 2 and 1/2 tan ^(1/2) radians respectively, 
Theorem 2 of [7] shows that the determinant of every normalized codeword difference matrix 
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D satisfies 


detDP > 


1 

10 • 2 ^' 


Our discussion so far is summarized in the following formal statement characterizing of the 
performance of this code. 


Proposition 7.1. The code described in (lilSI) . with 6 i = 1/2tan ^2 and 62 = 1/2tan ^(1/2), 
is approximately universal for every MIMO channel with two transmit antennas. 


7.1.1 Discussion 

While the two codes discussed above are explicit and easy to encode, they lack a computation¬ 
ally simple decoding algorithm. In general, it appears hard to design explicit approximately 
universal codes for the MIMO channel with a computationally simple decoding algorithm; it 
still remains an open problem. For the parallel channel we have been able to answer this ques¬ 
tion to a reasonable extent. The difference in the two models arises due to the rotation matrix 
in the SVD decomposition a parallel channel code has to be optimal for a fixed rotation 
matrix (the identity matrix) while a MIMO channel code has to be optimal for every rota¬ 
tion matrix. This difference seems to naturally lead to codes with a number-theoretic flavor: 
they are delicately designed so as to cope with every possible rotation. Such a code with a 
computationally simple decoding algorithm has not yet been found. 

An alternate view point is proposed in m where a lattice based space-time code is con¬ 
structed. The authors show that the structure of these codes resembles random Gaussian codes 
and then conclude the tradeoff optimality of an ensemble of lattice codes for a decoder based on 
a generalized MMSE estimator for the i.i.d. Rayleigh fading channel. A typical code in this en¬ 
semble is very unlikely to be approximately universal. In fact, one of the important conclusions 
of the the authors of m is that their construction shows that maximizing the determinant cri¬ 
terion is not a necessary requirement for achieving the tradeoff for specific fading distributions. 
However, as we see here, maximizing the determinant criterion is a necessary and sufficient 
condition to design robust codes that are tradeoff-optimal for every fading distribution. 

7.2 The V-BLAST Architecture 

The V-BLAST architecture was proposed for high rate communication over the MIMO channel 
m- It splits the data stream into independent streams that are sent over the different transmit 
antennas. It is very clear that V-BLAST is not tradeoff optimal at low rates: the largest 
diversity of any data stream is limited by the number of receive antennas. However, it is also 
clear that the V-BLAST scheme cannot be approximately universal even at high rates: over 
the 2x1 MIMO channel suppose the channel from one of the transmit antennas is zero and the 
other channel is CJ\f{0, 1). Then the diversity obtained by the data stream sent over the first 
transmit antenna for any multiplexing gain is zero whereas the overall channel has a non-zero 
diversity-multiplexing tradeoff. Since the V-BLAST scheme does not code across the transmit 
antennas it takes a hit when the transmit antennas have asymmetric fading statistics. When 
all transmit antennas are statistically similar to one another, V-BLAST indeed turns out to be 
tradeoff optimal at high rates; we explore this aspect in detail in Section |HJ 
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7.3 The D-BLAST Architecture 


The D-BLAST architecture has been proposed to attain high diversity gains over the MIMO 
channel m- The data is split into independent streams that are sent over the MIMO channel 
in a diagonal fashion. The coding scheme can be written as 


0 


P? 


( 2 ) 

Pi 


„(1) 
Pnt — 1 
( 1 ) ( 2 ) 
Pnt Put 


„( 2 ) 

Pnt—1 


(T-nt+l) 

Pi 


(39) 


where = Pi\ ... ,pnt are the independent data streams. 

It is well known that the D-BLAST architecture with MMSE-SIC receiver preserves mutual 
information over any deterministic MIMO channel with Gaussian inputs; thus it converts a 
MIMO channel into an equivalent parallel channel (a tutorial description of this conversion is 
described in Chapter 8.5 of [5]). Therefore an approximately universal code over the parallel 
channel, such as the permutation code, when used as the streams of the D-BLAST architecture 
for the MIMO channel will be approximately universal for the MIMO channel. This approach 
of converting the MIMO channel into a parallel channel has also been used by Matache and 
Wesel in |3]. 

Alternatively, one can see its approximate universality by explicitly verifying that it satisfies 
the condition in Q for rit = rir- The product of singular values of the codeword difference matrix 
for (13911 turns out to be lower bounded by the product distance of the permutation code. Thus, 

ifp(0 

is a permutation code that is approximately universal for the parallel channel, then the 
D-BLAST scheme (1391) is approximately universal for the MIMO channel (see and compare (1^ 
and (HJ). 

A potential drawback is the initialization loss due to the zero padding in dSi) which reduces 
the effective rate. For a 2 x 2 channel with block-length three, a rate of R bits/stream corre¬ 
sponds to a rate of 2R/3 bits/symbol on the MIMO channel. In general, the actual tradeoff 
curve achieved by this scheme is 


where r is the multiplexing gain per symbol. For the block length T large, D-BLAST approaches 
approximate universality. For finite block-length, this scheme is strictly sub-optimal. The 
precise characterization for approximate universality also implies that this performance can not 
be universally improved upon using a better decoding strategy (than MMSE and successive 
interference cancelation). In Section 1^1 we see that the performance can indeed be improved 
upon for a certain restricted class of fading distributions using a better decoding strategy. 


8 The V-BLAST Architecture 

The V-BLAST architecture transmits independent data streams over the transmit antennas. 
This is closely related to how a multiple access channel is operated, the tradeoff performance of 
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which under i.i.d. Rayleigh fading is studied (using random Gaussian codes) in In this 

section, we study the performance of simple modulation schemes over the V-BLAST architec¬ 
ture: in particular, QAM constellations. While we have seen that the V-BLAST architecture 
can never be approximately universal, it still performs very well for an interesting restricted 
class of channels. 


8.1 Tradeoff Optimality over Rayleigh Fading Channels 

Consider operating the V-BLAST architecture over an nt x rir i.i.d. Rayleigh fading channel: 
we transmit independent data streams over each of the nt antennas; each data stream is trans¬ 
mitted un-coded using a QAM constellation (with points at each time symbol). This 

scheme corresponds to a total data rate of r log SNR bits/symbol over the MIMO channel. Our 
main result is the precise characterization of the tradeoff performance; the proof is available in 
Appendix O 

Proposition 8.1. Uncoded independent QAMs of size SNR’'/”'* points over the antennas of an 
nt X nr i.i.d. Rayleigh fading MIMO channel are proteeted by a diversity gain, d{r), where 

Thj'V 

dir) = nr -if > nt (41) 

nt 

> nr — r nr < nt. (42) 


Several interesting observations follow from this result. 


1. Apart from the fact that the channel can be in outage, there is an additional error event in 
the V-BLAST architecture: the presence of the other simultaneously transmitted streams 
impacts the reliable reception of any particular data stream. However, the reliability 
performance represented in dm is as if the other streams didn’t exist at all. This suggests 
that the typical way error occurs is not due to the inter-stream interference but because 
of the channel being in outage. 

2. With nt = nr = n, the diversity gain of uncoded QAMs is equal to n — r] this matches the 
optimal diversity gain characterized in ^ for large enough r (> n — 1). This observation 
is graphically illustrated in Figure [71 

3. In a multiple access setting with 

• nt users with one transmit antenna each, 

• a symmetric multiplexing gain of r/nt per user, 

• nr > nt receive antennas, 

the diversity-multiplexing tradeoff is given by EHl 

nrr 

nr -. 

nt 

Therefore this simple scheme is tradeoff-optimal. 


(43) 


4. With nt ^ nr, the performance of uncoded QAMs is never equal to the optimal diversity 
gain of the channel. 
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Spatial multiplexing gain: r = R/log{SNR) 


Figure 7: The i.i.d. Rayleigh fading channel with nt = Ur = n. 


Rayleigh fading is a physically relevant fading model and we have seen the tradeoff optimality 
at high rates of plain uncoded QAMs using the V-BLAST architecture. We can conclude the 
robustness of this performance if it continues to hold for a wider class of fading distributions; 
this is the focus of the next section. 


8.2 Tradeoff Optimality over Isotropic Fading Channels 


The key property of a fading distribution determining the diversity performance is the near zero 
behavior of its singular values. In particular, denoting (/)i,..., (^min(nr,nt) to be the increasingly 
ordered squared singular values of H, suppose 


T ^ Cl) • • • ) 4^min{nr,nt) — Cmin(nr,nt)} 


^fcl+1 


^min(nr,nt) 


(44) 


for ei < • • • < ^min{nr,nt)' here our notation /(e^, . . . > Cmin(nr,nt)) 5 (^ 1 ) • • • ) ^rain{nr,nt)) in 
the sense of 

log/(ei,... 

5 ^min(nr,'n.t)) -i /'Ar\ 

lim lim • • • iim ----^^ = 1. (45) 

^l—>0e2—>0 £min(rar,rat)“^^ log^fCl) • • • : Cmin(nr,nt)) 

We also assume that all the singular values have an exponential tail, i.e., , for there exists an e 
such that for large enough x, 

P {(/)£> x} < 6“*^^ V i. (46) 


For a given near zero behavior of singular values, the tradeoff curve can be explicitly determined. 
We compute it for the case when fejS are increasingly ordered (as is the case for i.i.d. Rayleigh 
fading). 


Theorem 8.1. If ki < k 2 <■■■ < fcmin(nr,nt)) then the tradeoff curve is piecewise linear with 
min(n,.,nt) segments and the segment (i.e., s < r < s + 1) is given by: 

min(nr,nt) 

{^min{nr,nt)—s + l)(s + l-r)+ ^ (ki + l) 

£=mm{nr,nt)—s-\-l 
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Furthermore, random Gaussian codes with block-length T > k^\i,(^nr,nt)-s + 1 achieve this 
performance. 

Proof. See ApDendix IH.ll for the outage curve calculation. The proof of achievability for random 
Gaussian codes is a simple generalization of achievability proof in PP and we omit it here. □ 

The key property of the i.i.d. Rayleigh fading channel used in the calculation of the per¬ 
formance of uncoded V-BLAST transmission is the rotational symmetry of its statistics. We 
can thus generalize this calculation and characterize the performance of uncoded V-BLAST 
transmission over isotropic distributions on the n x n MIMO channel H: 

HQ has the same distribution as H for every unitary matrix Q. (47) 

If the ordered singular values of the n x n MIMO channel H decay slower than the corre¬ 
sponding decay rate of ordered singular values of H with i.i.d. Rayleigh fading, then we can 
extend our earlier observation of tradeoff optimality of the transmission of uncoded QAMs over 
the V-BLAST architecture at multiplexing gains r > n — 1 on the i.i.d. Rayleigh fading channel. 
We make this precise in the following proposition, delegating the proof to Appendix IhI 

Proposition 8.2. Consider n x n isotropic MIMO channels with the polynomial decay rates 
of its squared singular values as defined in (1111) . The uncoded QAM transmission over the 
V-BLAST architecture at multiplexing rates r > n — 1 is tradeoff optimal for every isotropic 
MIMO channel satisfying 


ki > (2i — 2) -I- /ci i = 2,... ,n, 

ki < 0 . 

9 The D-BLAST Architecture 

We have seen (cf. Section 17.d|) that the D-BLAST architecture with approximately universal 
parallel channel codes over its independent constituent data streams approaches approximately 
universality for large block length (cf. (I4()j) i. For any hnite block length, the architecture is 
strictly not approximately universal. However, we will see in this section that by restricting the 
class of MIMO channels over which we demand universality, the performance of the D-BLAST 
architecture can be significantly improved. In particular, our focus throughout this section is 
with isotropic MIMO channels. We characterize the diversity performance of the D-BLAST 
architecture with exactly two data streams; our main result is the observation of a restricted 
universality result for channels with 2 receive antennas. 

The i.i.d. Rayleigh fading MIMO channel is also isotropic and we state our results first in 
this context; the calculations are relatively simple and shed insight as to why we can expect 
robustness when generalized to arbitrary isotropic channel distributions. 

9.1 Tradeoff Optimality over Rayleigh Fading Channels 

Consider the nt x 2 i.i.d. Rayleigh fading MIMO channel; the tradeoff curve is composed of two 
linear segments, as illustrated in Figure |HJ 
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Spatial multiplexing gain: r = R/ log{SNR) 


Figure 8: The tradeoff behavior for the n* x 2 i.i.d. Rayleigh fading channel. 


9.1.1 D-BLAST and the First Segment 

Consider the D-BLAST architecture with only two independent data streams: 


0 • • • 0 Pnt Qnt 


: 0 

0 p2 : 

Pi qi 0 • • • 0 


(48) 


here [pi, ■ ■ ■ ,Pnt] and [qi,...,qnt] are unit block-length approximately universal codes for a 
parallel channel with rit sub-channels. Suppose both these codes have a data rate of 


(nt + l)r 
2nt 


log SNR 


bits per sub-channel. 


(49) 


Since the overall architecture is composed of two data streams and the transmission lasts nt + ^- 
time symbols long, the overall data rate of the architecture is r log SNR bits/symbol. Our main 
result is a precise characterization of the diversity performance under joint ML decoding of the 
streams; the proof is available in Appendix IFl 


Proposition 9.1. The D-BLAST architecture in (1481) with approximately universal parallel 
channel codes as its two data streams operated at a total multiplexing gain of r over the i. i. d. 
Rayleigh fading nt x Ur MIMO channel with Ur >2 sees a diversity gain equal to 


Ur 



nt + l \ 
- r . 

2 J 


(50) 


A couple of observations follow: 

1. If we set Ur = 2, the diversity performance in (CT is equal to 2nt — {nt + 1) r] this overlaps 
with the optimal tradeoff curve of the channel for small enough multiplexing gains, i.e., 
r < 1, thus achieving the first segment for the x 2 i.i.d. Rayleigh fading channel (see 
Figure 0. 
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Spatial multiplexing gain: r = R/ log{SNR) 


Figure 9: Diversity performance of the D-BLAST architecture. 


2. From the perspective of one of the streams in the D-BLAST architecture, the best diversity 
performance is obtained if the other stream didn’t exist at all (or was decoded correctly 
and thus canceled exactly). Suppose this is the case: then each data stream sees a 
parallel channel with nt scalar sub-channels, each of whose squared amplitudes are i.i.d. 
with distribution X2nr- optimal tradeoff curve for this parallel channel with a data 
rate of {nt -|- 1) /2 bits/symbol (cf. is 


Ur 



nt + l \ 
-r . 

2 J 


(51) 


The diversity performance of any data stream with the other stream being perfectly can¬ 
celed cannot be any more than the gain in m- However, from the claim in Proposition^^ 
(cf. pIlL we observe that this upper bound is exactly equal to the diversity gain achieved 
even when there is inter-stream interference. There we conclude: 


Under the joint ML decoder, inter-stream interference is not the typical error 
event. 


We study the joint ML decoder in some detail in the next section. 

3. Finally, we observe that we crucially used the symmetry between the two streams in the 
above argument. With more than two streams, the middle streams see more interference 
than the outer two streams and an extension to this situation is not natural. 


9.1.2 D-BLAST and ML Decoding 

In this section, we discuss the ML decoding of the two data streams in the D-BLAST architec¬ 
ture in some detail. To make our discussions simple and concrete we focus on the simple case 
of nt = 2; the received signal spans three time symbols and can be written as 


[yiy2y3] 


[hih2] 


0 P2 q2 
Pi qi 0 


-I- [■W1W2W3]. 
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The two data streams \pi,P 2 ] and [qi,q 2 ] are unit block-length approximately universal codes 
for a parallel channel with 2 sub-channels and independent of each other. For concreteness, 
suppose Pi {qi) and p 2 ( 92 ) are points from a QAM constellation and correspond to bit reversal 
with alternative bits flipped of each other (cf. Section I5.T5I) . The ML decoder makes a joint 
decision on both these codes using the three received vectors yi,y 2 ,y 3 - However, due to the 
specific structure of the zeros in the D-BLAST architecture, the joint ML decoder can be broken 
down algorithmically into three separate steps: 

1. We observe that the received vector at the first time symbol yi gives information only 
about the the QAM symbol pi: 


yi=pih2-kwi. (52) 

In particular, yi specifies exactly the most significant bits of the bit representation of the 
QAM point pi (cf. Section ESI)- More specifically, the number of MSBs of pi that can 
be deduced from yi is with high probability equal to [log (|h 2 pSNR)J; further more, the 
information about the remaining bits of pi depends on the noise wi that is independent 
of the received signals at the other two time symbols. Since the QAM points pi and 
P 2 correspond to bit reversals (with alternate bits flipped) of each other, we have also 
deduced the least significant bits of [log (|h 2 pSNR)J of p 2 - 

2. The scenario at the third time symbol is identical to that at the first time symbol except 
that Pi is replaced by q 2 and p 2 by qi. In particular, we can deduce [log (|hipSNR)J 
MSBs of q 2 (and the [log (|hipSNR)J LSBs of gi) from ys; further more, the information 
about the remaining bits of 52 (and hence qi) depends on the noise vector W 3 that is 
independent of the received vector at the first two time symbols. 

3. We are now ready to focus on the received vector at the second time symbol: 


y2 = P2^1 + gih2 + W2- (53) 

Here we know some of the LSBs of both p 2 and qi (due to processing of the received vector 
at the first and third time symbols, respectively); this reduces the randomness in p 2 and 
qi to another sparser QAM which is a subset of the original QAM from which they were 
drawn. We see from ( 1 ^ is exactly the output of a 2 x 2 MIMO channel with uncoded 
QAMs transmitted over the two transmit antennas, i.e., uncoded QAM transmission over 
the V-BLAST architecture. Thus, the ML decoding of the two streams of the D-BLAST 
architecture reduces to that of a decoding uncoded QAM transmission over the V-BLAST 
architecture. 

9.1.3 A Time-Space Code and the Second Segment 

While we have seen the tradeoff optimality of the D-BLAST architecture in achieving the first 
segment of the rit x 2 i.i.d. Rayleigh fading channel, there is a simple transformation of this 
architecture that achieves the second segment of the same channel. The key is to consider a 
time-space version of the space-time D-BLAST architecture: replace the transmit symbol at 
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time symbol m over the transmit antenna k by the transmit symbol at time symbol k and 
transmit antenna m. In particular, the time-space version of the space-time code in is 


0 

... 0 

Pi 


P2 

91 

0 

92 

0 

Pm 

Qm 

0 ••• 

0 


(54) 


It is meant to be used over a channel with nt + l transmit antennas and spans rit time symbols 
long; observe that the original code in is meant to be used over a channel with n* trans¬ 
mit antennas and spans rit + 1 time symbols long. Suppose that [pi,... ,Pnt] and [gi,..., 
independent unit block-length approximately universal codes for the parallel channel at rate 
0.5log SNR bits per sub-channel; this corresponds to the overall code in JMI) to have a total 
multiplexing rate of r bits/symbol. Our main result is a precise characterization of the diver¬ 
sity performance of this space-time code over the i.i.d. Rayleigh fading channel; the proof is 
available in Add endix lF.il 


Proposition 9.2. The diversity gain of joint ML decoding the data streams of the time-space 
code in at a total multiplexing rate of r bits/symbol over the {nt -|- 1) x i.i.d. Rayleigh 
fading MIMO channel with Ur >2 is equal to 


ntUr 

2 


(2-r). 


(55) 


Setting rij. = 2, we see that the diversity gain in (|53|) is equal to 2nt — ntr which overlaps 
with the optimal tradeoff curve for that channel for large enough multiplexing gains, i.e., r > 1; 
in particular, this achieves the second segment of the tradeoff curve (see Figure ITnll . 



Spatial multiplexing gain: r = R/ \og{SNR) 


Figure 10: D-BLAST curves vs the optimal tradeoff curve 
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9.1.4 Tradeoff Optimality over Isotropic Channels 

We demonstrate the robustness of the performance results a time-space code for the i.i.d. 
Rayleigh fading channel by generalizing them to the class of isotropic fading distributions: in 
particular, we are interested in MIMO channel distributions which satisfy the property in (EH). 
Further recall the definition of the polynomial decay rates of the squared singular values of 
the n X n MIMO channel in EH). The proofs of the results in this section are available in 
Appendix IHl 

Our result is the restricted approximate universality of the time-space version of the D- 
BLAST architecture with two data streams in achieving the second segment of the tradeoff 
curve; this generalizes the result in Proposition 19.21 The proof of this result is available in 
Appendix IhI 

Theorem 9.1. The diversity gain of joint ML decoding the data streams of the time-space code 
in EH at a total multiplexing rate of r bits/symbol over any isotropic nt+i x 2 MIMO channel 
achieves the second segment of its tradeoff curve, provided 

k2- h > 2 , 

ki < 0 . 


10 Conclusion 

We have presented a precise characterization of universally-tradeoff optimal codes for the MIMO 
channel. We also presented explicit codes for the parallel channel that are simple to encode and 
decode. These codes, along with the general construction in m, completely solves the code 
design problem for the parallel channel. For the MIMO channel, we suggest using the D-BLAST 
architecture to reduce it to a parallel channel and using codes designed for the parallel channel. 
This approach is reasonable when the block-length is large, since in this case the initialization 
overhead in D-BLAST is insignificant. While, finite block length approximately universal codes 
for the MIMO channel have been constructed, they are not known to be simple to decode; 
construction of simple codes for the MIMO channel remains an open problem. 

Alternative to approximately universal codes for MIMO channel, we have seen the existence 
of simple codes for the MIMO channel that are approximately universal for a restricted class 
of fading distributions. Our construction has been restricted for specific number of antenna 
elements; a generalization of this construction is also an interesting future research direction. 

A Converse for Approximate Universality 

We want to show that if a coding scheme does not satisfy the universal code design criterion, 
then there exists a fading distribution such that the coding scheme is not tradeoff optimal. In 
the high SNR scaling of PP, a coding scheme is defined by a discrete sequence of codes (7(5NR) 
with rate r log SNR. If this sequence does not satisfy the approximate universality criterion, 
then there exists a subsequence of (7(5NR) such that for every code in the sub-sequence there 
exists a codeword pair such that it does not satisfy the universal criterion. For proving the 
existence of a fading distribution such that the original sequence is not tradeoff optimal, it is 
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enough to find a fading distribution for which this subsequence of codes is not tradeoff-optimal. 
Therefore we assume that for every code in the sequence we can find a codeword pair that does 
not satisfy the universal criterion. 

A brief note regarding our notation; we use the symbols = (>,<) to denote exponential 
equality (inequality), i.e., 


/(SNR) = SNR^ 


lim 

SNR^oo 


log/(SNR) 
log SNR 


b. 


A.l Proof of Theorem 15.11 


Here we focus on the necessity of the condition for approximate universality for the MIMO 
channel. If a sequence of codes is not approximately universal, we show that there exists an 
i.i.d. distribution on ijjis such that this sequence of codes is not tradeoff optimal. 

For codewords X/i and X^, the pairwise error conditioned on a channel realization, H, can 
be written as (cf. Q): 


Pe (x^^XbIh) = g 


SNR^gnK,nO|^^|2|^^|2 


The approximate universality condition can then be written as: 

mm{nr,nt) 

min SNR V > 1. 

Efci l°g(l+l’/’d2SNR)>rlogSNR ^ 

Thus, if a sequence of codes does not satisfy the universal criterion then there exists a sequence 
of codeword pair differences, D(SNR), and a corresponding realization H“(SNR) such that 

mm{nr,nt) 

SNR ^ |A£(SNR)|2|V;^“(SNR)|2 < ( 50 ) 

£=l 

for some positive e, where H“(SNR) satisfies 

min(nr,nt) 

log(l +|V'£“(SNR)|2SNR) =rlogSNR. (57) 

£=l 

Now define H^(SNR) as 

|/;f^(SNR)|2 = |V^“(SNR)|2.2reiogSNR^ / = 1 ,..., L. 

Then using (EEl) and ISZI), H^(SNR) satisfies 

L 

SNR^|A^(SNR)| 2 |/;^^(SNR )|2 < 1 and 

e=i 
L 

^log(^l + |V'^^(SNR)|2SNR) > r(l + e)logSNR 

£=l 


(58) 

(59) 
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Now, consider the i.i.d. fading distribution on H such that: 


p{lV^£p<-| = 4 V i = (60) 

I ^ J Xe 

The diversity for the code-sequence can then be upper bounded using the following sequence of 
steps: 

1. The pairwise error for the codeword difference Z)(SNR) can be lower bounded by a con¬ 
stant, (5(-v/0.5), for a range of channels such that (see ll^ i: 

{H:|^,|2<|V:,'(SNR)|2, ,l]. 

Furthermore, because of the power constraint on the input we can assume that 

|V>^(SNR)p > V i = ,mm{nr,nt) (61) 

If this is not true, we can increase '0£(SNR) to min(n ^nf) sM (1601) and (t^ still 

hold. 

2. Hence the probability of error can be lower bounded by 

_ mm{nr,nt) 

Q{V^) n p{|V’£p< IV’£'(snr)|2} 

P,(C(SNR)) > --. (62) 

Writing 


|V’£^SNR)|2 ^ sNR-“^ 

the probability of error expression (1621) can be written as (also see (1601) 1: 

Pe(C(SNR)) > SNR~(?S^“«+^), (63) 

where a^s satisfy (see jsni) and (|6T|)1: 

Ed — CK^)"*" > ''’(1 + e) and < 1. 
e 


Therefore 


aj < min {rir, nt) — r{l + e). 
l 

Then the probability of error for (7(5NR) is lower bounded by (see (1601) and (El): 

Pe((7(SNR)) > 

= SNR^(e 
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Thus, the diversity of the sequence of codes is upper bounded by: 

2 

— (min (n^, n^) — r) — r. (64) 

The outage curve on the other hand is given by^: 

- (min {nr,nt) — r). 

Thus, comparing with (ini), this sequence of codes is not tradeoff optimal and hence not ap¬ 
proximately universal. 


B Proof of Theorem 15.21 


Consider a parallel slow fading channel with L sub-channels. A permutation code over this 
channel can be rewritten as 


C = 


SNR 


{Q,f2iq),--jL{q)) k e Qz >, 


where 


^ f, sNR^/2 

Qz = {{a + lb) :--- <a,b< —-— 


(65) 


is the integer-QAM with SNR’" points, and / 2 ,...,/l are permutations of Qz. We define the 
normalized product distance between two codewords as 


'^d{qi,q2) = 


qi - TT \ fkiqi) - fk{q 2 )\" 


SNR^ 


n 

k=2 


SNR" 


The condition for approximate universality, (HI, on the other hand, can be written as 

1 


T^d{qi,q2) > 


SNR" 


Vgi / q2- 


( 66 ) 


(67) 


The number of permutation codes with SNR" points is given by 

((SNR")!)^-k 


We now prove existence of a permutation code in this ensemble such that (|H7|) is satisfied. 
We average of the inverse of product distance over all such codes under the uniform measure 
(all codes have the same probability). The intuition behind averaging the inverse of product 
distance is to capture the codeword differences that have small product distance, which is the 
event of interest. 


E 


1 


1 SNR" 1^ SNR" 

(SNR"!)i-iSNR2" \qi-q2\^ I^(^i) "’ 

qi,q2^qi 


proof for this result can be seen from Appendix m Equation (Ei, with nt replaced by min (n,.,nt) and 
ntr replaced by r and a replaced by -. 
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1 


1 


((SNR''-2)!)-^-iSNR^^ 
(SNRn)^-iSNR2'’ 


E 

91,927^91, 

k=2,-,L 


n 


91 -^2p \q\-ql?' 


The second equality is obtained by considering all permutations /^’s that map qi to < 7 ^ and 52 
to gf; the number of such permutations is ((SNR'" — 2)!)^“^. Therefore, 


E 


■ 1 ■ 

1 

y ^ 

.TTd. 

" SNR^’- 

y 191 - 92 P 

_qi,q2¥^qi 


n L 


Because of the symmetry of the QAM, the average inverse product distance can be further 
upper bounded as 


E 


1 


< 


1 


SNR 


Lr 


SNR^ y 

„ynl9i| 


n L 


917^0 


( 68 ) 


n L 


E 

giT^o 


l9i| 


The summation inside the parantheses in can be upper bounded by (logSNR)^. This 
implies that the expectation can be upper bounded by 


E 


1 


< 1 . 


We conclude that there exists at least one permutation code C“ with the average inverse product 
distance less than 1 . We now use this code C“ with good average behavior to construct a code 
that has a good worst-case behavior. For C“, 


1 


E 


1 




< 1 . 


Therefore, 


SNR^ 


y ff(gi) < SNRQ 


gisQz 


where 


9(91) = y 


iJ26Qz,927^91 


71 ^( 91 , 92 )’ 


Thus, at least half of the qis have g((?i)<SNR^. By expurgating at most half the codewords, 
we can construct a code such that: 


5(91) < SNR’- Vgi. 


(69) 
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This implies that for every q 2 ^ Qi, 

T^d (91,92) 

this is precisely the criterion for approximate universality dSH). Finally, expurgating at most of 
half of the codeword reduces the rate of the code by at most one and hence does no change the 
multiplexing gain. Thus, there exist approximately universal permutation codes. 

B.l Product distance distribution 

A statement much more stronger that that made about the code constructed in Section m 
The result below characterizes the behavior of the product distance vr^, cf. (Ibhl) . (rather than 
just a lower bound, which is what was required for approximate universality), and hence can 
be thought of as a weight distribution result for the product distance. 

Theorem B.l. Consider a parallel slow fading channel with L sub-channels. There exists a 
permutation code with SNR^ points over this channel such that the number of codeword pairs 
that have a product distance less than SNR^”'’ is 0(SNR^+^), fork in [0,r]. 

Proof. We start with the code constructed above that satisfies p|): then for each qi, the 
number of codewords which are at a product distance less than SNR^”'’ is 0(SNR^), for k in 
[0,r] (otherwise such a code will not satisfy (IHUjl h Considering all possible values of qi, the 
number of codeword difference that have product distance less than SNR^”'’ is 0(SNR^"'“^), for 
k in [0, r]. 

□ 


C Proof of Theorem 15.31 

Let the binary representation of integers ai and 02 be: 

~ ^R/2 ■ ■ ■ ^ 1 , 

Let k be the largest integer such that 6], bf,. Then without any loss of generality we can 
assume that = 1 and = 0. We also write 

hi = b] = bf V k -\- 1 < i < R/2, 

for notational convenience as well as to emphasize that the largest R/2 — k bits are the same. 
Now, let I be the smallest integer such that > b‘j,_i. Note that this implies that 

= 0 b\_i = 1 V — 1, 

which is similar to the codeword pair that was the counter example given for the fact that 
simple bit-reversal is not universal (see Section Fb.d.hi) . Here we essentially prove that such pairs 
are the only reason that the simple bit reversal is not approximately universal and bit reversal 
with alternated bit flipping can tackle this problem. We consider the following subcases: 
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• If no such I exists, Then OjS can be written as: 


— ^_R /2 ■ ■ ■ &fc+ilO • • • 0 

®2 = ^_R /2 ■ ■ ■ ^fc+iOl • • • 1 

and B{ai)s can be written as: 

B{ai) = 101 -•• 

B{a2) = 010 -•• 

Thus B{ai) — B{a 2 ) is lower bounded by hence (|!TT]) is satisfied. 

• I >2: Then OjS can be written as: 

= bji/2-■-bk+iW-■ ■0bl_ibl_i_i-■-bl 

«2 = 6^/2 • • • 6 fc + l 01 • • • • • • 6 ? 

then, the difference ai — 02 can be lower bounded by and B{ai)s can be written 

as: 

B{ai) = b\bl---bl_i_-^bl_ill0l---0bk+i---bR/2 
B[a2) = b\bl---bl_i_^bl_iQ{)l{)---lbk+i---bR/2 

Then the difference \B{ai) — B{a 2 )\ is lower bounded by (here we have 

assumed that bk-i is not flipped, i.e., k — I is even; if A: — Hs odd, then same argument 
hold with 02 and ai reversed). Thus, the product distance is lower bounded by 
(which is the one in (lillll l. 

• If ^ = 1: then OjS can be written as: 


— ^i ?/2 • • • 

«2 = bji/2 ■ ■ ■ bk+iObl_i ■ ■ - bl 

then the difference ai — 02 can be lower bounded by 2^“^ (since b\_^ > b\_^). The B{ai)s 
can be written as: 


B{ai) = b\b\-■ ■b\_^lbk+ibk+ 2 -■ ■bR /2 
- 8 ( 02 ) = blb\ • • • hl_^Qbk+ibk +2 ■ ■ ■ bR/2 

and the difference |i?(ai) — B{a 2 )\ is lower bounded by 2 ^/^“^ (here we have assumed 
that bk-i is flipped, i.e., k is even; same is true if k is odd). Thus, the product distance 
is lower bounded by j-^rJ 2 - 
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D Proof of Theorem 15.4 


We again consider the I and Q channels separately. Then we want to define L—\ permutations of 
the PAM such that the corresponding permutation code is approximately universal. We consider 
the g-digit representation of the PAM. For a PAM with g” points and number it from left to 
right by 0 to g” — ! (in term of the rate R, n behaves like ) • For showing that a universally 

decodable system satishes the product distance criterion, we have to resort to irregularly spaced 
PAMs. For every least significant q-bit change, we put a gap of Similar to the 

two sub-channel case, using this construction we prove that any universally decodable scheme 
satisfies the condition for approximate universality: consider any two codewords; suppose for 
the sub-channel their MSBs are the same and MSB is different. By construction 

of the irregularly spaced QAM, the normalized (by q^) separation in the coordinate is lower 
bounded by 




The universal decodability condition implies that if ki > n, then there exists a unique 
codeword corresponding to the MSBs. Therefore, the k^s must satisfy 


< n. 


Thus, the product distance can be lower bounded by 


|did2---dL|2/^ > 


l/L 




= [Q 


l/L 


> 


^ -2n/L 

jL 

g22^ 

1 


(70) 


implying that the code satisfies the approximate universality condition (PIJ. For a PAM of size 
g”, the (normalized) increase in size is given by 


gq^ ^ ” (number of LS q-bit changes) = gg"* "(g"' ™'), 

m=l m=l 

= gn. 


(71) 


In the high SNR scaling, 


g^ = SNR’' ^ gn = logSNR. 
Thus the extra spacing does not affect the multiplexing gain. 
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We also note the Theorem E31 is true even if the field size q is growing like log SNR. Note 
that if q grew like a polynomial in SNR, i.e., like SNR'", then we can no longer ignore q in (f7n|) 
and such a code then will not be approximately universal. We also have to show that the power 
gain because of the gaps still increases slowly enough so as to not affect the multiplexing gain. 
For a PAM of size q"^, the increase in size is, cf. (EH, 

gn < SNR*^ V e > 0. 

Therefore, the extra spacing does not affect the diversity-multiplex tradeoff. 


E Proof of Propositions 16.11 


We use an approximately universal parallel channel code, (e.g. a permutation code [pi, • • • ,Pnt] 
with total rate Rrit) over the MISO channel in a diagonal fashion: 


Pi 0 0 

0 0 

0 0 Pnt 


(72) 


We prove that scheme El is tradeoff optimal for MISO channel with i.i.d. fading coefficients. 
Since it operationally converts the MISO channel into a parallel channel, we only need to 
match the outage probabilities of the MISO channel and the corresponding parallel channel. 
The outage probability of the MISO channel is given by. 


nt 


log I + ^lhipSNR <rlogSNR 


2 = 1 


For the equivalent parallel channel, the outage probability is given by 

1 


P< ^log(l + IhipSNR) < riirlogSNR 


. 2=1 




(73) 


(74) 


The near zero behavior of sum of \hi\‘^s can be upper and lower bounded as: 




nt 

< P 


nt I 

< X > < (P{|hip < x]Y' . 


Since the upper and lower bound have the same decay rate, the probability of outage of the 
MISO channel, (TTHl) . has a decay rate of 


V SNR ) 


(75) 


Thus, the outage curve of the MISO channel with i.i.d. fading coefficients with the a denoting 
the decay rate of | hi p near zero is 


dontir) = ant{l-r). 
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The second outage probability, is somewhat more involved. Define ai by 

1.12 SNR-- 

' SNR ■ 

In this notation, the outage condition for the parallel channel can be written as 

^ a* < ntr. 


(76) 


Since the sub-channels are independent, the outage probability (cf. (TTUl l has the decay rate 

/SNR“'\“ 


max 

Ql,-" ^OLni 


n 

i=l 


SNR 


(77) 


where the maximization is under the constraint in Thus, the decay rate of the outage 

probability expression in (|77|) is 

/Snr^\^‘“ 

(w) ’ 

the same as that in m-, this completes the proof. 


F Proof of Proposition 19.11 


We prove that the diversity obtained by the code in is Ur {nt — f), where flog SNR is the 
rate of codes [pi, • • • ,Pnt] and [gi, • • • , qnt]- 

The pairwise probability of error, averaged over the Rayleigh fading channel with receive 
antennas is given by |H1 

P(Xo^Xi) < dg^(i+(Xo-Xi)(Xo-Xi)t)"’-- 

The difference codeword pair can be written as: 


0 • • • 0 dUnt 


0 dl : 

d\ 0 • • • 0 


(79) 


where = [d\, • • • , and d'^ = [d^, • • • , d^J are the codeword difference for a permutation 
code. 


Expanding (Xq — Xi)(Xo — Xi)"!^ in terms of the streams, we get: 

det (l + (Xo - Xi)(Xo - Xi)t) > \d\dl ■■■dlf + \dldl • • • dlf 
The probability of error can be upper bounded using the union bound: 


Pe < 


^ i_ 

x^x {\dldl---dl,\^ + \d\dl---dt?r- 


42 












This upper bound can be broken into two summations: one corresponding to where both the 
streams are different and the other summation where one of the streams is the same. Suppose 
the same code is used for both the streams; now the upper bound can be simplified: 


IPe < 


E 




+ 


E 




1 

+ \d1d2 ■ ■ ■ dntl)^'" 


The arithmetic mean-geometric mean inequality for the term inside the first summation yields 

1 1 


*e < 


E 




+ 


E 




(sNR-’_gj«... 


I 

Unt \ 


+ 


SNR’ 


E 






Now, we use the product distance distribution result in Appendix IB.ll to separately bound the 
two summations on the RHS. The result says that the number of codeword differences pairs 
with \d^ - ■ ■ dnt P less than is 


SNR^ 


r+/c 


for k in [0, r]. Using this result, the first term can be upper bounded as: 

2 

1 1 


?E 


SNR^^O 


< 


1 cMofc + fSNR ’’’'("-'=)/2 

max -=SNR'‘+’^- 


fce[o,f] SNR*^ 




= SNRa-/^)‘i^' 

\ke[o,f] 


for Ur > 2. The second term corresponds to the error when one of the streams is decoded 
correctly and can be directly verified to be of the correct order. Alternatively, 


2 1 


< 


max - -^SNR 
fce[o,r] SNR’^ 


k-\-r 


SNR»^r(r-fc) 

SNR"’-”* 


max 

fce[o,f] 


5,^Rfc(l-nU 


SNR”’-’' 

SNR”’-’"*’ 




Thus, combining the two upper bounds, for Ur > 2 there exists a code such that the diversity 
gain is 

nr{nt - f). 

Taking f = proves Proposition 19.11 
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F.l Proof of Proposition 19.21 

For the (nt + 1) x 2 channel, we transposed the code in (1481) which was used for achieving the 
first segment nt x 2 channel. The probability of error can be calculated using a union bound 
calculation. The pairwise probability of error is given by 

1/det (l + (Xo - Xi)(Xo - XOt)"”. 

Since 

det(l + (Xo-Xi)(Xo-Xi)t) = det(l + (Xo-Xi)t(Xo-Xi)) , 

the union bound calculation for calculating the probability of error is exactly the same as same 
as (|1H|) case. Therefore the diversity obtained by this scheme is given by UrUt — Urf. But in 
this case we are coding over a block-length of nt, thus the actual tradeoff curve is nrnt{2 — r)/2, 
where r log SNR is the per symbol rate of the channel. 


G Proof of Proposition 18.11 

The scheme of sending nt QAM constellations can be written as 


q = 


SNR 


SNR^ 


— ik, • • • ,*2nt )\ij G Pz V j = 1,... ,2nt > , 


where Pz is the integer PAM constellation with SNR^"^‘ points. For a Rayleigh fading channel, 
the probability of pairwise error averaged over the fading statistics is given by [S]: 


^(qi ^ q2) < 


1 


1 + ||qi - q 2 |P 

Using the union bound the probability of error is bounded by: 




e < 


< 


1 


SNR’ 


E- 

q^O 


E 

qi7^q2 

1 

n \ |2nr ’ 


1 


1 + l|qi - q 2 | 


where 0 is the 2nt dimensional vector of zeros. The second step follows from the symmetry 
of the QAM. To compute the summation in on RHS, we split into a summation over vectors 
such that all its components are non-zero and then use the arithmetic mean-geometric mean 
(am/gm) inequality. We denote a subset of the index set, {1,2, ••• ,2nt}, by S. Then the 
summation can be simplified as 


E 

q^O 


1 


rrir 

SNR~ 


|q| 


I StT-t* 


E 


1 


SNR"- ^ (\i,\2 + 


+ 14. IT” 
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1 


rrir 

SNR ^ V ^ 


rnr 

SNR "t „ 1 

< —E E El- 


< 


SNR" 


SNR^ 

SNR”" 


S ijf^O:jeS j&S ^ 


i |2nr/|5| 


using am/gm inequality 


e(e^ 


|S| 


S VziT^O 


./|S| 


Since the range of snmmation |zi| is growing with SNR, the inner summation has different 
behavior for depending on whether IS"! is larger/smaller than 2nr- 




ilT^O 


l-2nr/\S\ 


if 2nr < I S'! 


= 1 otherwise 


But because of the definition of 5, liSI is naturally upper bounded by 2nt. Thns, for > nj, 
the probability of error can be upper bounded by: 


/ rrir 'i 

Pe < SNR"^ "t I. 

On the other hand, if < nt, then the probability of error can be upper bounded as: 

|S| 


(80) 


< 


< 


rnr 

SNR~ 

SNR”" 


E Eu 


1 


^^| 2 nr/|S| 


S \n7^0 

SNR^ ^ 


SNR" 


^ (snr"/2 ^*) 


2nr<\S\<2nt 


< 


max 


SNR "t 

SNR”" 2nr<\S\<2nt 
Snr-O"-") 


, \ \S\-2nr 

Snr"/2^M 


H Isotropic MIMO Channels 


We concentrate on the rotationally invariant distributions. For this class, the singnlar value 
distribution determines the channel statistics completely. Let /(</>) be the density function of 
the ordered squared singnlar values, (p, of the channel gain matrix. In terms of notation of 
Section [3 we have 

4>i = 'ipj for £ = 1 , • • • , min (n^, rii), 


where ^plS are the singular values of H. In the high SNR regime, we are only interested in the 
near zero behavior of 4>. Therefore, in the scaling of interest, / can be assumed to be of the 
form: 


/( 0 ) = <(-^1 •••</- 


^min(n 7 - ) -n- 

min(nr,nt 


(81) 
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This is same as the earlier definition of distribution of the squared singular values; 

^ \V^1 ^ tl) • • • ) Ymm(nr,nt) — ^min(nr,nt) J ~ ' ' ' ^min(nr,nt) ’ 

for ei < • • • < Cmin(nr,nt) • 

For Rayleigh fading distribution, 0 has the Wishart distribution which can be reduced to 
this polynomial form by ignoring the exponential terms in the Wishart distribution (for the 
exact expression, see P): 

r(.A) i <®) 

where ri = \nt — n^l + 2{£ — 1 ). 

In this appendix, first we characterize the outage curve in terms of kts for general /. Then, 
we use this characterization to characterize restricted universality for codes based on the V- 
BLAST and D-BLAST architecture proposed in Section and Section El respectively. 

H.l The outage curve for general fading distributions 

For a general fading distribution, F, we want to calculate the probability of outage. The outage 
event can be written as: 

min(nr,nt) 

log (1 + (/i^SNR) < r log SNR. 

£=l 

If we write 


(()^ = SNR-"0 (83) 

then the induced distribution (from (1811) 1 on the ordered vector a is 

pi^cx) = ( 84 ) 

which can obtained by change of variables (P|). The outage probability will be dominated by 
the OL that is on the boundary of outage and has smallest SNR exponent. More precisely, using 
Laplace’s method as in ^ , the outage curve is the solution to the optimization problem 

inf y^{ke + l)ai, (85) 

oteA' ^ 

where 

^ = < Q : ai > • • • > aminK,nt) > 0 and ^^^(1 - a0+ < r 

I i 

The fact that a^s are positive uses our assumption that the singular values have an exponential 
tail. Let’s assume for some integer s, s < r < s + 1. Then, if 

^ = 1,''' , min (n^, n*) - s - 1 (86) 

f = min (n^, n*) - s + 1, • • • , min (n^, n*), 
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then the optimizing cx in (IH51) is given by : 

0*1 = 1 for £ = 1, • • • , min (n^, nf) — s — 1 

“min(n^,nt)-s = ^ + 1 - r 

a| = 0 for £ = min (rer, nt) — s + 1, • • • , min (n^, nt). 

The corresponding outage curve is given by: 

min(nr,nt) 

dont{r) = {kmm{nr,nt)-s + '^){s + l-r)+ ^ + 1) for s < r < s + 1. 

£=min(nr )—s+1 

In particular, we would like to stress that if all the kts are increasingly ordered then the cx 
that dominates the outage probability for fading density / is the same one that dominates the 
outage probability for for i.i.d. Rayleigh fading. 


H.2 Restricted universality of V-BLAST and D-BLAST 

We want to prove that the simple QAM code for the V-BLAST architecture and codes based on 
using permutation codes over the D-BLAST architecture are universal over a class of isotropic 
fading distributions. We know that all these codes are tradeoff optimal for the i.i.d. Rayleigh 
fading channel under the union bound calculation. We exploit this fact to prove optimality over 
isotropic distributions that fade slower than i.i.d. Rayleigh fading. 

We denote the diagonal matrices with entries t/), the singular values of the the channel gain 
matrix, and A, the singular values of the codeword difference matrix, as 'J' and A. Then the 
probability of pairwise error averaged over the channel statistics can be written as (see ®): 


(Xa^Xb) = / Q 


/H 


IHDI 


dH 


(87) 


Q 


JVi 


I'J'ViUsAp 





Q 


# JVi 


|$V2ViU2A|P 


dVid^ 




where the last two steps use the independence of $ and Vi and rotational invariance of Vi 
respectively. The integral with respect to V is taken with respect to the Haar measure and 
does not depend on the distribution of $ and is only a function of the realization T' and the 
code. 

Now, the probability of error can be upper bounded using a union bound 


< 


SNR’ 


E 



Q 


^ JV 


'||$i/2va||2 


dVd^, 
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where the summation is over all possible codeword difference pairs. Since all the terms are 
positive, interchanging the order of the summation and integration the union bound can be 
written as 


Pe < 





The term inside the outer integral only depends on the code and the channel realization $ 
and not on the fading distribution. We denote it by g{cj}). Then the smart union bound can be 
written as 

Pe < / g{(t>)f{4>)dcf), 

J<p 

where / is the density function of cj). Similarly the upper bound corresponding to the smart 
union bound is given by 

Pe < P(W)+/ g{cj>)f{ct>)dcj>, (88) 

where 7i is the set of all channel realizations in outage. If we assume that the union bound is 
tight for Rayleigh fading, then it implies 



SNR-'^«W, 


where r(0) is the is density for the i.i.d. Rayleigh fading channel and d*^{r) is the corresponding 
outage curve. We use d*p{r) to denote the optimal curve for a generic density /. 

Then, for any / the second term in (P|) can be upper bounded as 




\cj>m J 


< 


/( 0 ) 

max—— 




(89) 


The expression to be maximized can be written as (see m and 


mm{nr^nt) min{nr,nt) 

max = min (/)“T (90) 


where ui = ri — ki Now, we consider the codes from Section [H] and El and explicitly compute 
the maximization (EUl) . 
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V-BLAST 


For the last segment of an re x n channel, none of the singular values can die completely (i.e., , 
SNR^ 


become less than ci^), therefore the no-outage condition can be written as; 


D' 

£=i 


> 


SNR’" 


- SNR" 


Therefore the minimization (inn|) can be written as 


D' 

n 0£>|^^=i 


mm 

, SNR’" 


h ’ 


(91) 


(92) 


with an additional constraint that the are bounded by one (using the exponential tail 
assumption). If we assume that ui is larger than U£ for every i >2, then 

111 


mm 


n 


n <i>i> 


mm 

SNR^ 


e=i 


< 


rr ^ \ SNR^ V=l 
11 W>SNR" 


max ](([ 
Ct> 1=2 


ui-ut 


If we assume that rei > 0 and rei — re^ > 0 for every i > 2, then the optimizing solution is 
given by: 

6 * = ^ 

SNR" 

= 1 for £ = 2,--- ,L. 

This optimal point is same as the point (in terms of oc), that optimized the outage probability 
calculation in (IS51) . Then, at the optimal point we can write: 


e=i 




n”=i' 


n n 

e=i(t>£ 


SNR'^flW ■ 

Therefore, using (15^) and the probability of error can be upper bounded by 

CMp—rfplr-) 


SNR'^flW 


< SNR-'^J’W. 


Thus, the code is also tradeoff optimal for the channel with fading density /, where / satishes 
the following conditions: 

ri > ki 

— ki > ri + 2{i — 1) — ki for i = 2, ■ ■ ■ , L. 

Combining these two conditions, we get 

ki -2{e-l) > ki for £ = 2, • • • , L. 

ki < 0 . 
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D-BLAST 


For the last segment of an ret x 2 channel, none of the singular values can fade completely (i.e., 
become less than 5 ]^), and hence the no-outage condition can be written as; 

M 2 > (93) 

This means that the minimization (|^ can be written as 

min (94) 

Now, this optimization problem is the same as the V-BLAST optimization problem in (El, 
with re = 2. Hence, the optimality condition on ki and k 2 turns out to be 

k2 — ki > 2 

fci < 0 
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